Search | WHO COVID-19 Research Database

Contrastive Learning Improves Critical Event Prediction in COVID-19 Patients (preprint)

Tingyi Wanyan; Hossein Honarvar; Suraj K. Jaladanki; Chengxi Zang; Nidhi Naik; Sulaiman Somani; Jessica K. De Freitas; Ishan Paranjpe; Akhil Vaid; Riccardo Miotto; Girish N. Nadkarni; Marinka Zitnik; ArifulAzad; Fei Wang; Ying Ding; Benjamin S. Glicksberg.

arxiv; 2021.

Preprint in English | PREPRINT-ARXIV | ID: ppzbmed-2101.04013v1

ABSTRACT

Machine Learning (ML) models typically require large-scale, balanced training data to be robust, generalizable, and effective in the context of healthcare. This has been a major issue for developing ML models for the coronavirus-disease 2019 (COVID-19) pandemic where data is highly imbalanced, particularly within electronic health records (EHR) research. Conventional approaches in ML use cross-entropy loss (CEL) that often suffers from poor margin classification. For the first time, we show that contrastive loss (CL) improves the performance of CEL especially for imbalanced EHR data and the related COVID-19 analyses. This study has been approved by the Institutional Review Board at the Icahn School of Medicine at Mount Sinai. We use EHR data from five hospitals within the Mount Sinai Health System (MSHS) to predict mortality, intubation, and intensive care unit (ICU) transfer in hospitalized COVID-19 patients over 24 and 48 hour time windows. We train two sequential architectures (RNN and RETAIN) using two loss functions (CEL and CL). Models are tested on full sample data set which contain all available data and restricted data set to emulate higher class imbalance.CL models consistently outperform CEL models with the restricted data set on these tasks with differences ranging from 0.04 to 0.15 for AUPRC and 0.05 to 0.1 for AUROC. For the restricted sample, only the CL model maintains proper clustering and is able to identify important features, such as pulse oximetry. CL outperforms CEL in instances of severe class imbalance, on three EHR outcomes with respect to three performance metrics: predictive power, clustering, and feature importance. We believe that the developed CL framework can be expanded and used for EHR ML work in general.

Subject(s)

COVID-19 , Coronavirus Infections , Extravasation of Diagnostic and Therapeutic Materials

Federated Learning of Electronic Health Records Improves Mortality Prediction in Patients Hospitalized with COVID-19 (preprint)

Akhil Vaid; Suraj K Jaladanki; Jie Xu; Shelly Teng; Arvind Kumar; Samuel Lee; Sulaiman Somani; Ishan Paranjpe; Jessica K De Freitas; Tingyi Wanyan; Kipp W Johnson; Mesude Bicak; Eyal Klang; Young Joon Kwon; Anthony Costa; Shan Zhao; Riccardo Miotto; Alexander W Charney; Erwin Böttinger; Zahi A Fayad; Girish N Nadkarni; Fei Wang; Benjamin S Glicksberg.

medrxiv; 2020.

Preprint in English | medRxiv | ID: ppzbmed-10.1101.2020.08.11.20172809

ABSTRACT

Machine learning (ML) models require large datasets which may be siloed across different healthcare institutions. Using federated learning, a ML technique that avoids locally aggregating raw clinical data across multiple institutions, we predict mortality within seven days in hospitalized COVID-19 patients. Patient data was collected from Electronic Health Records (EHRs) from five hospitals within the Mount Sinai Health System (MSHS). Logistic Regression with L1 regularization (LASSO) and Multilayer Perceptron (MLP) models were trained using local data at each site, a pooled model with combined data from all five sites, and a federated model that only shared parameters with a central aggregator. Both the federated LASSO and federated MLP models performed better than their local model counterparts at four hospitals. The federated MLP model also outperformed the federated LASSO model at all hospitals. Federated learning shows promise in COVID-19 EHR data to develop robust predictive models without compromising patient privacy.

Subject(s)

COVID-19

Machine Learning to Predict Mortality and Critical Events in COVID-19 Positive New York City Patients (preprint)

Akhil Vaid; Sulaiman Somani; Adam J Russak; Jessica K De Freitas; Fayzan F Chaudhry; Ishan Paranjpe; Kipp W Johnson; Samuel J Lee; Riccardo Miotto; Shan Zhao; Noam Beckmann; Nidhi Naik; Kodi Arfer; Arash Kia; Prem Timsina; Anuradha Lala; Manish Paranjpe; Patricia Glowe; Eddye Golden; Matteo Danieletto; Manbir Singh; Dara Meyer; Paul F O'Reilly; Laura H Huckins; Patricia Kovatch; Joseph Finkelstein; Robert M Freeman; Edgar Argulian; Andrew Kasarskis; Bethany Percha; Judith A Aberg; Emilia Bagiella; Carol R Horowitz; Barbara Murphy; Eric J Nestler; Eric E Schadt; Judy H Cho; Carlos Cordon-Cardo; Valentin Fuster; Dennis S Charney; David L Reich; Erwin P Bottinger; Matthew A Levin; Jagat Narula; Zahi A Fayad; Allan Just; Alexander W Charney; Girish N Nadkarni; Benjamin S Glicksberg.

medrxiv; 2020.

Preprint in English | medRxiv | ID: ppzbmed-10.1101.2020.04.26.20073411

ABSTRACT

Coronavirus 2019 (COVID-19), caused by the SARS-CoV-2 virus, has become the deadliest pandemic in modern history, reaching nearly every country worldwide and overwhelming healthcare institutions. As of April 20, there have been more than 2.4 million confirmed cases with over 160,000 deaths. Extreme case surges coupled with challenges in forecasting the clinical course of affected patients have necessitated thoughtful resource allocation and early identification of high-risk patients. However, effective methods for achieving this are lacking. In this paper, we present a decision tree-based machine learning model trained on electronic health records from patients with confirmed COVID-19 at a single center within the Mount Sinai Health System in New York City. We then externally validate our model by predicting the likelihood of critical event or death within various time intervals for patients after hospitalization at four other hospitals and achieve strong performance, notably predicting mortality at 1 week with an AUC-ROC of 0.84. Finally, we establish model interpretability by calculating SHAP scores to identify decisive features, including age, inflammatory markers (procalcitonin and LDH), and coagulation parameters (PT, PTT, D-Dimer). To our knowledge, this is one of the first models with external validation to both predict outcomes in COVID-19 patients with strong validation performance and identification of key contributors in outcome prediction that may assist clinicians in making effective patient management decisions.

Subject(s)

COVID-19

Prevalence and Impact of Myocardial Injury in Patients Hospitalized with COVID-19 Infection (preprint)

Anuradha Lala; Kipp W Johnson; Adam J Russak; Ishan Paranjpe; Shan Zhao; Sulaiman Solani; Akhil Vaid; Fayzan Chaudhry; Jessica K De Freitas; Zahi A Fayad; Sean P Pinney; Matthew Levin; Alexander Charney; Emilia Bagiella; Jagat Narula; Benjamin S Glicksberg; Girish Nadkarni; James Januzzi; Donna M Mancini; Valentin Fuster.

medrxiv; 2020.

Preprint in English | medRxiv | ID: ppzbmed-10.1101.2020.04.20.20072702

ABSTRACT

Background: The degree of myocardial injury, reflected by troponin elevation, and associated outcomes among hospitalized patients with Coronavirus Disease (COVID-19) in the US are unknown. Objectives: To describe the degree of myocardial injury and associated outcomes in a large hospitalized cohort with laboratory-confirmed COVID-19. Methods: Patients with COVID-19 admitted to one of five Mount Sinai Health System hospitals in New York City between February 27th and April 12th, 2020 with troponin-I (normal value <0.03ng/mL) measured within 24 hours of admission were included (n=2,736). Demographics, medical history, admission labs, and outcomes were captured from the hospital EHR. Results: The median age was 66.4 years, with 59.6% men. Cardiovascular disease (CVD) including coronary artery disease, atrial fibrillation, and heart failure, was more prevalent in patients with higher troponin concentrations, as were hypertension and diabetes. A total of 506 (18.5%) patients died during hospitalization. Even small amounts of myocardial injury (e.g. troponin I 0.03-0.09ng/mL, n=455, 16.6%) were associated with death (adjusted HR: 1.77, 95% CI 1.39-2.26; P<0.001) while greater amounts (e.g. troponin I>0.09 ng/dL, n=530, 19.4%) were associated with more pronounced risk (adjusted HR 3.23, 95% CI 2.59-4.02). Conclusions: Myocardial injury is prevalent among patients hospitalized with COVID-19, and is associated with higher risk of mortality. Patients with CVD are more likely to have myocardial injury than patients without CVD. Troponin elevation likely reflects non-ischemic or secondary myocardial injury.

Subject(s)

Coronavirus Infections , Heart Failure , Cardiovascular Diseases , Diabetes Mellitus , Ischemia , Hypertension , Coronary Artery Disease , COVID-19 , Death , Cardiomyopathies , Atrial Fibrillation

Clinical Characteristics of Hospitalized Covid-19 Patients in New York City (preprint)

Ishan Paranjpe; Adam Russak; Jessica K De Freitas; Anuradha Lala; Riccardo Miotto; Akhil Vaid; Kipp W Johnson; Matteo Danieletto; Eddye Golden; Dara Meyer; Manbir Singh; Sulaiman Somani; Sayan Manna; Udit Nangia; Arjun Kapoor; Ross O'Hagan; Paul F O'Reilly; Laura M Huckins; Patricia Glowe; Arash Kia; Prem Timsina; Robert M Freeman; Matthew A Levin; Jeffrey Jhang; Adolfo Firpo; Patricia Kovatch; Joseph Finkelstein; Judith A Aberg; Emilia Bagiella; Carol R Horowitz; Barbara Murphy; Zahi A Fayad; Jagat Narula; Eric J Nestler; Valentin Fuster; Carlos Cordon-Cardo; Dennis S Charney; David L Reich; Allan C Just; Erwin P Bottinger; Alexander W Charney; Benjamin S Glicksberg; Girish Nadkarni; - Mount Sinai Covid Informatics Center (MSCIC).

medrxiv; 2020.

Preprint in English | medRxiv | ID: ppzbmed-10.1101.2020.04.19.20062117

ABSTRACT

ABSTRACT Background: The coronavirus 2019 (Covid-19) pandemic is a global public health crisis, with over 1.6 million cases and 95,000 deaths worldwide. Data are needed regarding the clinical course of hospitalized patients, particularly in the United States. Methods Demographic, clinical, and outcomes data for patients admitted to five Mount Sinai Health System hospitals with confirmed Covid-19 between February 27 and April 2, 2020 were identified through institutional electronic health records. We conducted a descriptive study of patients who had in-hospital mortality or were discharged alive. Results A total of 2,199 patients with Covid-19 were hospitalized during the study period. As of April 2nd, 1,121 (51%) patients remained hospitalized, and 1,078 (49%) completed their hospital course. Of the latter, the overall mortality was 29%, and 36% required intensive care. The median age was 65 years overall and 75 years in those who died. Pre-existing conditions were present in 65% of those who died and 46% of those discharged. In those who died, the admission median lymphocyte percentage was 11.7%, D-dimer was 2.4 ug/ml, C-reactive protein was 162 mg/L, and procalcitonin was 0.44 ng/mL. In those discharged, the admission median lymphocyte percentage was 16.6%, D-dimer was 0.93 ug/ml, C-reactive protein was 79 mg/L, and procalcitonin was 0.09 ng/mL. Conclusions This is the largest and most diverse case series of hospitalized patients with Covid-19 in the United States to date. Requirement of intensive care and mortality were high. Patients who died typically had pre-existing conditions and severe perturbations in inflammatory markers.

Subject(s)

COVID-19

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL